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PART I 


DEVELOPMENT 



SECTION 1 


INTRODUCTION 

by 

Louis Sutro 

The first of the two efforts described in this report is supported by the 
Bioscience Division of the Office of Space Sciences and Applications, NASA, and 
is concerned with communicating pictorial data from a remote location, such 
as Mars, to Earth. The second, supported by the Electronics Research Center 
of NASA, is to develop methods of making decisions at remote locations. Both 
are instrumentation efforts applicable to a Mars rover and both use biology as 
a source of ideas. 

/ 1 2) 

Earlier reports' * indicated that the processing of television pictures at 
a remote location was desirable, and could be facilitated by employing a "rover" 
that is, a pair of stereo television cameras on a mobile base. Since further on- 
board processing was required to position the cameras, the concept evolved of 
an assembly — to be called "robot" — of stereo cameras, vehicle, and computer. 
Included in the concept are inputs to the computer from senses other than the 
visual, namely, accelerometers to indicate the inclination of the cameras and 
their motion, and a tactile-kinesthetic sense of both the posture c£ the robot and 
its contact with the ground. 

The word "computer” has been used intwo ways in this and earlier reports. 
As illustrated above, it is used first to describe all equipment employed by the 
robot for processing or computation. For testing stationary equipment this 
computer will be the general-purpose machine described in subsection 2. 7; for 
testing mobile equipment it will be special-purpose and transportable. This 
report presents those specifications presently available for the mobile system, 
while more detailed specifications are being prepared that will lead to size, 
weight and power estimates necessary to judge practicality. 

The second use of the word "computer" is to describe the kinds of compu- 
tation performed by animal nervous systems. Section 7 introduces this second 
use of the word. 
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Each of the two efforts is being approached from the points of view of both 
development and research. Although the first six sections of the report, in 
general describe development and Sections 7 through 9 describe research 
into the nature cf animal sensory, decision and control systems, the line between 
research and development cannot be sharply drawn. 

The sections on development consider first, in Section 2, the tying together 
of the subsystems. Sections 3, 4 and 5 describe three aspects of the visual sub- 
system: camera-computer chains, processing and representation of the environ- 

ment. Section 6 describes additions being made to the decision subsystem. 

The first of the research sections — Section 7 — describes Dr. Warren 
McCulloch's concept of the nervous system as interconnected computers. Section 
8 develops mathematically a concept of visual processing, known as lateral inter- 
action, which appears basic to the functioning of higher animal retinas. Finally, 
Section 9 describes the theory behind animal learning, a body of knowledge 
which must be understood if an engineering system such as that in Section 2 is to 
adapt to changes in its environment. Section 10 is a summary and conclusion. 
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SECTION 2 


SUBSYSTEMS AND THEIR COORDINATION 
by 

Louis Sutro, William Kilmer and Joseph Convers 


2. 1 General 

The system whose parts are described here includes the robot, means of 
transmitting pictorial and scientific data to Earth, and equipment for decoding 
it and displaying it to an Earth operator. Subsystems of the robot are outlined 
in subsection 2.2, after which follow a description of the LOOK mode of opera- 
tion and proposed means of mounting the visual on the mobility subsystem. 
Coordination of subsystems is described in principle in subsection 2. 6 and 
simulation within a general purpose computer in subsection 2.7. 

2.2 Robot Subsystems 

The visual subsystem includes a camera assembly of the kind described in 
Section 3, processing of the kind described in Section 4, and the making and 
interrogating of a model of visual and tactile space described in Section 5. 

Equipment intended to contact the environment either to report on it or to 
affect it is called the contact subsystem — examples are the hand and arm 
assemblies shown at the left center of Fig. 2-1. Since vision, in many ways, 
is a distant method cf feeling the shape of things, coordination is required of 
the visual and contact subsystems. This appears achievable either by a match- 
ing of models of the kind described in Section 5 or by the learning mechanisms 
described in Section 6. 

The mobility subsystem considered during this report period is the vehicle 
developed by General Motors for a landing on the moon and now being redesigned 
for a landing on Mars. Following their practice, this subsystem includes the 
basic structural elements, wheels, wheel drives and steering actuators as 
illustrated by Fig. 2-2. 

The fourth component, the decision subsystem, determines the mode in 
which the robot operates. For initial tests, only the following four modes will 
be possible: 
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EARTH 



MOBILITY SUBSYSTEM 


Fig. 2-1 . Subsystems of the robot proposed for o Mors landing. 


4 


















HIGH GAIN 
ANTENNA 



Fig. 2-2. Typical vehicle developed for lunar and planetary exploration by the 
Defense Research Laboratory, AC Electronics Division, General 
Motors. Length is 72 in., width 36 in., height of wheels-18 in. 


Look 

Move 

Rest 

Maintenance 

These are mutually exclusive. For example, the robot cannot LOOK and MOVE 
at the same time because the cameras are not inertially stabilized during vehicle 
motion. The REST mode is planned for night, when lack of light makes looking 
and therefore movement difficult. MAINTENANCE could use the wheels to check 
for vehicular trouble or use the cameras either to check their own operation or 
check other equipment, but the intent will not be the same as the LOOK or MOVE 
modes. 

The robot will have other modes of operation: one to try to get it out of a 

difficulty such as wedging a wheel between rocks, others to perform specified 
tasks or experiments. Yet the robot is not planned to be autonomous. The top 
command remains on earth. 

The upward pointing arrows in Fig. 2-1 represent connections that can be 
made to the telecommunications subsystem, which in a test system being planned 
would be a microwave link to a nearby control station. A high-gain directional 
antenna and an omni-directional antenna, both intended for this purpose, are 
shown in Fig. 2-2, 
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2.3 The LOOK Mode of Operation 


The LOOK mode of operation may take one of these forms: to look for a 
passage ahead for the robot to follow, to look for objects having specified proper- 
ties of scientific interest, or to look at the scene or object to record pictures 
for transmission to Earth. 

Figure 2-3 indicates how a scene might appear on Mars in two successive 
looks (the two are separated by the vertical line at the center of the illustration). 
At the center of each look is a high-resolution central field of the kind to be 
achieved by camera assembly E, described in subsection 3.10. Around it is the 
low-resolution peripheral field. In this illustration the ratio of the central field 
to the peripheral is made 1/5, whereas in camera E it is planned to be 1/10. 

The LOOK mode begins with the levelling of the platform shown in Fig. 2-4. 
Possible next steps are the pitching of the cameras to scan the lowest line of the 
nearest rectangle; then turning of the cameras one position at a time, so that their 
point of convergence steps along a line the width of the rectangle. The illustration 
shows the cameras in the position where they receive images from the same point 
on the rock. If this image is an edge, as here, it can be considered a pointer in 
the direction to be explored next. Following such pointers, the cameras trace the 
edge of the rock. Then, converging further away, they can sweep the second rect- 
angle to find the hole in the ground. Converging still further away, they can sweep 
the farthest rectangle to find the edge of the cliff. Since this process may be dif- 
ficult to realize if images of the scene are crowded, other approaches are being 
studied. 

Points as well as lines may define a surface. Experiments conducted at 
Lincoln Laboratory, MIT, indicate that white dots against a black background, 
located in both the left and right views of a stereoscopic picture, can perform 
the same function as a set of lines. Such dots could also be black against a 
light background or merely contrast with the adjoining area. Thus in the dis- 
play on Earth a surface such as Fig. 2-4 could be defined by points without 
connecting lines. 

2 .4 Mounting a Stereo Pair of Cameras on a Vehicle 

Figure 2-2 shows the proposed General Motors vehicle to which equipment , 
is planned to be attached. A vehicle of similar but earlier design was delivered 
to Jet Propulsion Laboratory and is operating there. 

Two approaches have been made to the mounting of a stereo pair of cameras 
on such a vehicle. One is to support the stereo pair, as in Fig. 2-5, on a boom, 
which can be raised to permit looking in the distance or lowered to permit look- 
ing immediately in front of the vehicle. The other approach is to mount the pair 
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looks surrounded by low resolution look: 



Fig. 2-4. Discrimination of objects by a stereoscopic pair of TV cameras and a computer. 

as close to the ground as possible, as in Fig. 2-6. For both approaches it is de- 
sirable, after vehicle movement, to level the camera platform and then maintain 
the pitch axis of the cameras horizontal as they look up or down, sideward, or 
backward. 

The first approach required more mechanical complexity than appeared wise 
at the start. The second placed the weight of the camera assembly too far for- 
ward. Figure 2-7 is a compromise, placing the camera assembly above the axle. 

2.5 Contact Subsystem 

If recognition of objects is to be performed as it is in animals, more than 
one sensory input is required. While information provided by a single sense is, 
in general, inadequate for recognition, the addition of a second sense generates 
what is necessary to validate the original evidence. With the integration of the 
contact and the visual subsystems, a reliable recognition capability should be 
achieved. 





Fig. 2-5. l^se of a boom (crane) to elovate stereo pair of cameras. 



Fig. 2-6. Camera assembly E mounted in front of forward axle of General Motors 
six-wheeled vehicle. 






Fig. 2-7. Stereo iuair of cameras mounted oo front axle. 





Figure 2-6 is the first attempt to include a hand-and-arm assembly in the 
system. Each hand is at the end of an arm which is supported through a joint 
above a front wheel. Two objections to this arrangement are the possible conflict 
of a front wheel with an arm and the mechanical difficulties of cantilevering a 
long arm with precision. The design of Fig. 2-7 lessens these difficulties by put- 
ing the wagon-type steering at the rear as in a ship, and the shoulders both lower 
and nearer the center. The hand, as presently designed, contains a line of spring- 
loaded pins with a shaft position encoder on each. From the line array can be' 
determined the relative displacement of the pins and hence the contour of an object. 

The following elements feed data to or receive data from other subsystems: 

a) Tactile input of the hands. 

b) Kinesthetic input of each joint. 

c) Motors which position the hand for touching. 

2.6 Coordination of the Subsystems 

The checking of one sensory subsystem by others should partially overcome 
inadequacy or failure. For example, the contact subsystem can reexamine the 
shape and location of objects detected by the visual subsystem. The visual hori- 
zon might be used to validate the inclination of the cameras as reported by the ac- 
celerometers. 

Checking one subsystem by another would be of no use without means cf 

resolving differences between them. This is the function of the decision sub- 
13 4) 

system. ’ 'whose principal component is the decision computer. Since this sub- 
system will be more effective with more kinds of sensory input, studies are being 
made to add hearing, the sensing of thermal radiation and sensors of internal 
conditions, such as power, temperature, and the progress of tasks being carried 
out. (See Fig. 2-1. ) Methods of having the decision subsystem learn are also 
being studied and are reported in Section 6. 

Thus Fig. 2- 1 might be curled on itself so that the emergency reactions and 
tasks, shown at the right, , relate to the visual, contact and mobility subsystems 
at the left. 

2. 7 Digital Computers 

Each of the boxes in Fig. 2- 1 is being developed as a program for a general 
purpose digital computer, although not for the same computer. The visual and 
contact subsystems are being developed in the PDPline of computer's described 
below, the decision subsystem in the IBM 360 Model 75. The latter is the main 
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facility at the Instrumentation Laboratory and is provided with a very facile 
compiler — the M. I. T. Algebraic Compiler. However, pressure to use this 
machine is so great that attachment of external equipment such as cameras and 
arm-and-hands has not been permitted. 

It is the intention of this effort to obtain first a combination of commercial 
machines in which all programs can be run and all equipment attached, then a 
single machine of equal facility but small and light enough to be carried on a 
vehicle such as that of Fig. 2 - 7 . The combination of machines in which all 
programs can be run and all equipment attached is expected to be a Digital 
Equipment Corp. PDP-9 to be delivered to the Instrumentation Laboratory in 
July of this year, and a larger computer to be connected to it by telephone lines. 

As presently planned, the larger computer would contain both the relatively 
slow formation of the model of the environment and its interrogation (Section 5) 
and the decision subsystem (Section 6 ). The PDP-9 would perform the faster 
interpretation of the output of the cameras (Section 4 ) and the commanding of 
their motion. The decision subsystem will continue to be developed in the IBM 
360 Model 75 while a copy of it is programmed for the larger of the two tied-to- 
gether computers. 

In the interval between ordering and delivery cf the PDP-9, a machine with 
similar instruction code, the PDP-7 in Project MAC, was made available for 
evening and off-hour daytime use. The programming reported in Section 4 was 
carried out in this machine. Finally, TX-0 is not being considered because its 
slow memory access rate is poorly matched to the high output rate of all cameras 
presently under development. 
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SECTION 3 


CAMERA-COMPUTER CHAINS 

by 

Louis Sutro, David Tweed, Joseph Convers and Charles Sigwart 


3.1 Objectives 

While the primary task of the visual subsystem is to sense the environment, 
tjaa,t is, to obtain pictures of the area surrounding the cameras, due to the time 
lag in communication and the low rate of transmission between the robot and 
5arth, it is also necessary that the robot construct a model of the environment, 
that is, an internal representation that can be interrogated to help in guidance 
and experimentation. As visual data flows from the cameras through successive 
transformations to the internal representation, it can be tapped at any point and 
transmitted to Earth. Thus, several choices will exist regarding not only what 
pictures are to be sent but also what will be their level of abstraction. 

A prerequisite for dealing with visual data is a means of determining range, 
such as obtaining two views from two vantage points. Engineers call such an ar- 
rangement stereoscopic, while psychologists argue that stereoscopic (stereo) vision 
is not only range finding on one point at a time but determination of the relative 

range of every point in the entire field for which there is a disparity between the 
(5) 

left and right views,' ' Camera-computer chains for the purpose of finding the 
range of one point at a time are considered later in this section. 

Beginning with subsection 3.2, the subjects considered are cameras A and 
B, both monocular. Then follow descriptions of three of the four types of stereo 
optics being considered for the visual subsystem, namely, the C, D and E. (A 
fourth type developed by Philco-Ford has been described elsewhere!” ) This 
section then shows how the stereoscopic camera assembly and its electronics 
can be tied to a computer to interpret its output and direct it. An analysis of 
the errors inherent in each camera type is presented. 
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Since descriptions of hardware and software can be conveniently separated, 
this section will dwell on camera-computer chain A, operated in the summer of 
1966, and chains C, D and E presently under development; while discussion of 
the programming to be employed in these chains is reserved for Section 4. 

A possible method of modelling the environment is presented in Section 5. 

Nonetheless, the subject material of the three sections has been conceived 
as a single unit. For example, the window, employed for detailed examination 
of the image, is formed by counters that open or close gates at a window frame 
superimposed either on the scene or on the stored image. The counters will be 
either special-purpose electronics (hardware) or a routine within a general- 
purpose computer (software). 

All of the cameras described in this section employ as their transducer the 
slow-scan vidicon (Type 1343-018) developed by the General Electrodynamics 
Corp. of Garland, Texas, for the Mariner and Surveyor spacecraft. It is being 
used to obtain the data reported here and may be replaced by solid-state trans- 
ducer arrays as these become available. 

3.2 Camera- Computer Chain A 

Camera chain A was monocular; that is, it employed a single optical train. 
Built in 1964 and improved in the spring of 1966^ it was linked to the TX-0 
computer in the summer of 1966 for experimentation in the detection of motion. 
(See Fig. 3-1. ) The camera had been designed at Lincoln Laboratory, MIT, to 
be operated as fast as its output could be loaded on an analog tape (500,000 Hz). 
After loading, the tape was to be run at a slower speed, so that its contents 
could be converted from analog to digital form and stored on a second tape for 
computer processing. 

In its use for the Instrumentation Laboratory, the camera was required to 
respond to four times as large a raster at the same resolution. Given the same 
dwell time per position in the raster, this would require that the video ampli- 
fier have four times as wide a passband as was required for the Lincoln Labora- 
tory application. 

In the actual test, due to the inavailability of high-speed analog tape, the 
camera was operated at the speed of digital tape, with the result that signals 
passed through the video amplifier at frequencies below the passband. Lowering 
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Fig. 3-1. CamocD^m putrt chain A Tho c.m j^ter is the TX-0 in the Department of ElocJical Engineering, MIT. 





the bandpass through partial modification of the video amplifier could not 
prevent noise from appearing in the data, and construction of the amplifier on 
printed cards within a tubular case made further modification difficult. Since 
Camera C was already in development, further work on Camera A was dropped. 

In spite of the difficulties camera- computer chain A was operated by a 
graduate student leading to a thesis on motion detection. ^ The study consisted 
of recording in the TX-Q computer pairs of television frames, comparing corre- 
sponding positions in them and generating a "decision map" of changes in 
luminance. To lessen the effect of noise the computer was programmed to 
"reduce" the decision map by rejecting isolated changes. 

Further experiments in motion detection have been deferred for the present. 
It is assumed that the method employed in camera-computer chain A could be 
employed also in succeeding designs. 

3.3 Camera- Computer Chain B 

Chain B was an assembly to implement motion detection in hardware. Un- 
developed beyond the sketching stage, it was to consist of two camera tubes, one 
of fast response, the other slow, both examining the same scene through a beam 
splitter. The outputs of the two tubes were to be compared to detect motion. 

3. 4 Designation cf Stereo Optics and Camera Electronics 

While camera computer chain A was being assembled, design began on 
stereo optics and continued on camera electronics. It became convenient to em- 
ploy a different designation for each of these lines of development. Thus, the 
basic type of the stereo optics is represented by a letter, e. g. , C, D, E; a 
variant of the basic type by a number and a letter, e. g. , Cl, C2, and the elec- 
tronics by a second number separated from the letter and first number by a 
dash. (See Table 3-1. ) 

Type C stereo optics is an arrangement of fixed mirrors to bring two sepa- 
rated optical paths onto a single vidicon. Type Cl is an attachment for a Bolex 
16 mm moving-picture camera that can also be attached to a television camera. 
As shown in Fig. 3-3, the axes of its two optical paths are parallel. A possible 
variant is a configuration with fixed convergent axes. 

Stereo optics types D and E are characterized by both variable convergence 
and ability to mechanically scan a field of view larger than that of the lenses. 

For reasons given in subsection 3.8, it was decided to provide two cameras in 
each of these types and turn and tilt them to achieve the variable convergence 



Table 3-1. Components of camera-computerchains. 


ITEM 

NO. 

COMPONENTS 

CAMERA-COMPUTER 

CHAIN 



A 

1-1 

Dl- 

El-3 

1 

Camera for A 

X 




2 

TX-0 Computer 

X 




3 

Opto-mechanical configuration Type Cl 


X 



4 

Camera electronics Type 1 


X 



5 

PDP-5> Computer 


X 

X 

X 

6 

Opto-mechanical configuration Type D1 



X 


7 

Camera electronics Type 3 



X 

X 

8 

Three gimbals, their torquers and resolvers 




X 

9 

Six gimbals, their torquers and resolvers 

_ 



X 


and mechanical scanning. Type E stereo optics provides each camera with both 
a wide-angle coarse-resolution field and one of narrow-angle and high-resolution. 
In the Type E each camera is capable of the motions provided for the D, plus 
pitch, roll and yaw motions to erect the pair of cameras from a tilted base. 

3.5 Type 1 Camera Electronics 

Experience with Camera A led to the design of the Type 1 camera electronics 
which is planned to be used with the Type Cl stereo optics. The resulting camera 
chain is called Cl-1. Type 2 camera electronics is a commercially available 
system, on order. Type 3 is the electronics of the two cameras of the D con- 
figuration, assumed here to have the characteristics of the Type 1. For purposes 
of comparison. Type 3 electronics is assumed to be employed also with camera 
assembly EL, although another type may be employed yielding twice the aperture 
response (resolution). 

When the field of view is imaged on the face of the vidicon the brightness or 
luminance of each small angle in this view causes an appropriate change in the 
resistance of the photoconductive surface that modulates the video output wave- 
form. The purposes cf the electronics involved, as indicated in Fig. 3-2, are: 

a. To deflect the electron beam of the camera tube horizontally 512 
positions and vertically 512 lines so as to describe a raster. Rate 
of advance of the beam should be a multiple of the computer clock rate. 

jb. To amplify the video signal and convert its amplitude at each raster 
position into a digital word to be entered into the computer. 

While the electron beam does not actually dwell at each position, it is useful to 
think of it as doing so. 
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Fig. 3-2. Block diagram of Type 1 camera electronics 

Deflection cf the beam along one line cf the raster is achieved by generating 
a ramp waveform in the horizontal sweep generator shown at the top center cf 
Fig. 3-2. The beginning and end cf the ramp are determined by the horizontal 
counter which counts from 0 to 511 at twice the computer clock rate. This 
multiple is chosen because two six-bit words can be fed into the computer 
memory in one clock time. (The remaining six bits are needed for flags, etc. ) . 
The computer clock rate is 1 MHz and the television camera frame rate is a- 
proximately 7 frames per sec, 

The video signal is amplified and converted to digital words also at twice 
computer clock rate, i.e., 2MHz, While the sampling theorem suggested an am- 
plifier bandpass of 1.0 MHz, it was possible to design it with a bandwidth greater 
than 1.5 MHz and a rapid rolloff after 2.0 MHz. Sampling distortion (aliasing) 
results but is moderate because the finite vidicon beam size causes the amplitude 
of the components of the video signal to roll off rapidly at high frequencies. While 
slightly reducing the S/N ratio of the amplifier, this extension of the video band- 
pass lessens distortion of these high frequency components. 
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3. 6 Uncertainties Due to the Type 1 Camera Electronics 


In the discussion that follows three terms will be used, error, uncertainty 
and systematic error. An "error" is a deviation from an expected value. An 

"uncertainty" is a random error. A "systematic error" is a consistent devi- 
ation from an expected value. (10) 

Uncertainty in the centering of the electron beam is due to D-C bias drift 
which in the present design is estimated to be a maximum of one part in 1000 or 
0. 03 v in the bias of 30 volts. This deviation is 1/I000th of the raster, whose 
useful width, as reported in subsection 3. 8, is 300 lines. Thus there is a center- 
ing uncertainty of 0. 3 line along both the horizontal and vertical directions. This 
uncertainty can be reduced to one half this amount or less by careful design. 

Systematic error in deflection is a function of the amount of deflection. In 
the present design this error is zero at the center cf the raster, but it rises to 
1/2% or 1- 1/2 lines at the periphery. This error could be measured and com- 
pensated for in computing range. It can furthermore be reduced by about a 
factor of five through the use of high-gain feedback amplifier techniques. Due to 
lack of uniform response across the surface of the vidicon there may also be 
uncertainty in the indication of luminance if many levels are to be interpreted. 

3. 7 Optics of Camera-and-Stereo-Attachment Assembly Cl 

In this and succeeding subsections the word "assembly" is used to describe 
optics, camera case and supporting mechanisms, such as gimbals. The word 
"assembly" thus refers to the optical and mechanical component of a camera- 
computer chain, but not to its electronics. 

Camera assembly C was designed to have the simplest available stereo 
optics. 

Since a vidicon camera may employ lenses intended for 16-mm movie 
cameras, it was possible to use the stereo attachment designed for a Bolex 
camera shown in Fig. 3-3. The stereo attachment receives two views cf a 
scene, 63-mm (2-1/2 in.) apart, and forms images of the views side by side 
on the face of the vidicon. In considering the performance of this assembly it 
is convenient to straighten the two light paths and show the two views imaged as 
though on separate cameras as in Fig. 3-4. Each lens receives light from a 
rectangular pyramid in space subtending a minimum linear angle of 28". The 
volume of space contained in the intersection of the pyramids is the scene that is 
viewed stereoscopically. 
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A 

b) with cover 



c) diagrammed to show reflecting surfaces 
Fig. 3-3. Stereo camero assembly Cl . 
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3.8 Anticipated Ranging Ability of Camera-Computer Chain Cl - 1 


The ability to range is a function of camera tube resolution, geometry of the 
optics, uncertainties in centering and deflecting the electron beam in the vidicon, 
and uncertainty in the positioning of the optic axes. (See Table 3-2.) Camera 
tube resolution was measured to be 300 lines in a 1/2 inch frame at 10% response 
factor, as described in Appendix A. This finite resolution produces an uncertainty 
in the position of a point in the image of ± 1.0 line which, in the geometry of the 
optics, is ± 24 minutes of arc. Centering uncertainty, described in subsection 
3.6, does not contribute to range uncertainty. Since both images are on different 
portions of the face of the same vidicon, centering uncertainty cannot change the 
distance between the points. Deflection error is systematic and proportional to 
the amount of deflection of the beam. Since this error is consistent, measurable 
and can be compensated for in computing range or corrected by redesign, it is not 
listed. There is no angular uncertainty in relative mechanical positioning because 
the optic axes are fixed in the stereo attachment. 

All of the uncertainties of Table 3-2 combine to produce a total uncertainty 
on the face of the vidicon which is equivalent to an uncertainty in angular position 
of a corresponding point in the scene. The exact expression relating the total un- 
certainty of position on the vidicon face to a range uncertainty is developed in 
Appendix B.3. There it is shown that range uncertainty increases approximately 
as the square cf the range. 

The indicated range falls within the curves cf maximum uncertainty C-C in 
Fig. 3-5. Camera-computer chain Cl-1 is expected to be tested in ranging on 
objects up to 3. 0 meters (about 10 feet) away. At that distance 

Table 3-2- Uncertainties of a point at the center of each image, that contribute to range uncertainty.* 



UAMERA-COMPUTERCHAIN 1 


Cl-1 

D 1-3 

E 

3 

El 

Z ’l 


MIN. 


MIN. 


MIN. 


MIN. 



ARC 

LINES 

ARC 

LINES 

ARC 

LINES 

ARC 

LINES 

Uncertainty due to finite 
resolution of vidicon and 
geometry of optics 

k24 

±1.0 

±3.0 

±1.0 

t3.6 

±1.0 

±.36 

±1.0 

Uncertainty of centering of 
electron beam of the vidi- 

0 

0 

±1.8 

f0.6 

±2.2 

+0.6 

±0.22 

±0.6 

con 1 







- ■ 


Uncertainty in the position- 
ing of the optic axes 

0 

0 

fl.O 

k0.34 

f 1.0 

±0.28 

±1.0 

±2.75 

Maximum total uncertainty 

±24 

min. 

fl.O 

lines 

±5 8 
min. 

±1.94 

lines 

k6.8 

min, 

fl.88 

lines 

kl.58 

min. 

±4.35 

lines 


*This table assumes systematic errors have been corrected 
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INDICATED RANGE (meters) 


4.0 



Fig. 3-5. Anticipated maximum uncertainties in camera-computer chains Cl-1 and Dl-3. 
(See Appendix 8.2 and 8.3 for computation of the points plotted here.) 


24 



the range uncertainty is about ±1.0 meter. The range uncertainty may be reduced 
by increasing the separation of the optic axes, as is done in the D Assembly. 

3.9 Camera-and-Gimbal Assembly D1 

The D Assembly is a frame in which various combinations of transducers 
and lenses may be tested. The assembly shown in Fig. 3-6 provides for rotat- 
ing each camera about its yaw axis and rotating both cameras as a unit about the 
pitch axis. The configuration chosen is appropriate fortesting range finding . 
with converging optic axes. The torquer-resolver platforms (gimbals) provided 
will make it possible to position each camera with an uncertainty cf about 30 
seconds cf arc and position a pair with an uncertainty cf about one minute of arc. 
(Other configurations which were considered are shown in Appendix C. ) 

An uncertainty cf one minute of arc was also sought for each vidicon-lens 
assembly (camera). 


Vidicon resolvable line width = = 0.0017 in. 


focal length, f 


line width 


RIGHT 

YAW 


sin 1’ 


5. 66 in. 


LEFT 
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Such a lens is undesirable when considering a mechanical support of that length. 

The lens selected has a focal length of 3 inches and images a 5,67° square field 
on the raster of the vidicon. With this width of field and focal length the resolu- 
tion limit is determined by the finite resolution of the vidicon. 

The D1 Assembly is shown in Fig. 3-6 with 3-inch focal length lenses. The 
physical length of the lens is 3.9 inches and the overall length of the lens and 
vidicon assembly (camera) is 13.0 inches. Each camera is turned by a torque . 
motor above it which measures 3 inches in diameter, while the position of the 
camera is sensed by a resolver below it measuring approximately 4 inches in 
diameter. The yaw axes of the cameras are separated by 6.250 inches. The 
cameras are mounted in a stereo box pitched by a torque motor, which is out of 
sight at the left. The pitch angle is sensed by the resolver at the right. Geometric 
relations in Camera-and-Gimbal Assembly D1 are shown in Fig. 3-7. 

Motion of each camera is in increments of 1 °, an amount being reduced by 
redesign. 

3,10 Anticipated Ranging Ability of Camera-Computer Chain Dl-3 

Table 3-2 summarizes the uncertainties in ranging by Camera-Computer 
Chains Cl-1 and Dl-3. For the latter it is assumed that the axes of the cameras 
converge so that = a ^ (see Fig. B-l, Appendix B). 

In determining the uncertainties in range finding with two cameras, the in- 
trinsic uncertainties are doubled since in a worst case uncertainties in the two 
cameras do not cancel each other. Thus, as tabulated in Table 3-2, uncertainty 
due to finite resolution of the vidicon becomes ±1, 0 line of aperture response 
rather than±0, 5 line for one camera. The drift of the zero position of the vidicon 
rasters of the two cameras are in general not in phase and result in a worst 
case error of ±0. 6 vidicon line. The amounts by which centering uncertainty 
and deflection error are reduceable by redesign are indicated in subsection 3.6. 

In addition, there is the sum cf the uncertainties cf mechanical positioning for 
the two cameras. As summarized in Table 3-2 the maximum uncertainty in the 
position of a point on the vidicon raster is ±1. 94 lines corresponding to an uncer- 
tainty in angular position cf the optic axis of ±5. 8 minutes of arc. 

This uncertainty of position in the image plane of about ±2 lines of aperture 
response results in uncertainties in range as developed analytically in Appendix 
B. The indicated range using Camera Assembly D as described falls within the 
curves of maximum uncertainty D-D in Fig. 3-5. At a range of 3. 0 meters the 
maximum uncertainty is about ±0, 13 meters (about ±5 inches). 
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3.11 Camera-and-Gimbal-Assembly E 1 


The El Assembly is being developed to determine the size and shape of a stereo 
assembly composed of two cameras, each of which receives on one vidicon both the 
narrow-angle high-resolution field shown at the bottom of Fig. 3-8 and the wider- 
angle coarse-resolution field shown at the top. It is to be carried by a vehicle. Its 
gimbals will level the axis of the stereo box each time the robot stops to look, at 
the same time permitting the cameras to look up, down, sidewards, or backwards. 

In addition to providing both wide- and narrow-fields of view, the optical train 
of camera El is folded so that while its longer focal length is 400 mm (16 inches) its 
front-to-back length is 6-1/2 inches. This will be achieved by mounting the vidicon 
at a right angle to the axis of the entrance to the optical train. (See Figs. 3-9 
and 3-10. ) Details of the optical train are given in Appendix D. 

The wide-field optical train of the El Assembly will provide range finding per- 
formance with more uncertainty than that cf the D1 Assembly while the narrow field 
optical train will provide range finding performance with less uncertainty. Figure 
3-11 shows the bounds of maximum uncertainty for the three optical trains, D-D 
representing that of camera computer chain Dl-3, E^-Ej and E 2 -E 2 representing 
those of the wide-angle and narrow-angle trains respectively in camera-computer 
chain El. The causes of the uncertainties are tabulated in Table 3-2. Note that 
the angular uncertainties due first to the finite resolution of the vidicon and geome- 
try of the optics and second to the centering of the electron beam are ten times 
as great for the El train as for the E2, while the uncertainties in positioning the 
optic axes are the same. 

On the other hand, the number of lines of uncertainty are the same in both 
optical trains for the first two causes, but ten times as great in the second opti- 
cal train for the third cause. (Lines of uncertainty are represented by s in the 
equations for range uncertainty in Appendix B. 2.) Doubling the aperture response, 
as proposed in subsection 3.5, will halve the first two causes of uncertainty but 
not affect the third. 


The effect of doubling the aperture response is most striking in the C con- 
figuration reported in the first column of Table 3-2, Accordingly, a possible 


C2 stereo optics is being considered that would be an improvement over the Cl 



Fig. 3-8. Diagram of composite field of view at vidicon of an E I camera. 
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Fig. 3-9. Camera El. Not shown are the shutter, means of focussing the lens and 
means of installing and changing filters. 
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CAMERA PITCH 
I AXIS 

AZIMUTH AXIS 


Fig. 3-10. Camera-and-gimbal assembly El. 
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INDICATED RANGE OF AN OBJECT (meters) 



RANGE OF AN OBJECT (meters) 


Fig. 3-11. Maximum uncertainties in indicated range for camera-computer chains Dl-3 and El-3. 
The curves D-D are the bounds of maximum uncertainty about the actual range q.. 
E-j-E-j and E 7 -E 2 are the corresponding bounds for the low and high resolution fields 
of the El-3 chain respectively. (See Appendix 8.2 for computation of the points 
plotted here.) 
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SECTION 4 


VISUAL DATA PROCESSING 
by 

Roberto Moreno-Diaz and James Bever 


4. 1 General 

This section is concerned with methods of processing visual data in such a 
way that significant features can be enhanced and abstracted. In particular, it 
considers processing of visual data for the generation of a "point model" cf the 
environment. What is meant by a point model can be expressed in a simple way 
as follows: imagine television records of the scene in front of the camera, that 

have been divided into small areas or portions, each containing many resolution 
elements. The position of each small area is given by a pair of numbers (x,y). 
By means of a range finding process, a third number could be assigned to each 
small area indicating how far away the part of scene there is imaged. The 
point model consists of the set of "points" (x, y, z) describing the scene. 

The point model represents the scene by giving a "replica" of it without 
much abstraction. The realization of an internal model in a more abstract way, 
so that storage can be further reduced, is the aim of such investigations as that 
reported in Section 5. 

The ranging process is based on finding pairs of homologous (corresponding) 
small areas — or windows — in a stereoscopic pair of views. To decide that two 
windows — one in the left and the other in the right — are homologous, the 
window in one view is kept fixed while being compared to a variety cf them in 
the other view, until matching is obtained. The matching process requires that 
the edges appearing in both windows are well defined. In general, the initial 
visual data is not so, and some initial processing to enhance the edges is neces- 
sary. For this reason, we began by considering different processings of the 
data to enhance contrast. 

A conventional technique to detect — or enhance — the edge in a picture is 
that of differentiation in two directions. This method is, however, very 

susceptible to noise. By considering how animal retinae process the data (see 
Section 8) are more generalized method cf contrast enhancement was arrived at. 
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that cf lateral inhibition. Actually, similar procedures have been extensively 

/ 1 3 14 15) 

used in optical filtering techniques.' ' ’ A natural generalization of process- 

ing of the type indicated in subsection 8.2 has resulted in the writing of a com- 
puter program able to handle not only contrast enhancement but also perform 
other kinds of "filtering" of the data — theword filtering having been suggested 
by similar techniques in optical processing and range finding. The general 
program is described in the following subsections, along with an account of the 
way it was arrived at. The method of finding homologous windows in subsection 
4.5 was the first attempt to solve the range finding problem. 

4.2 Characteristics of the Visual Processing Program 

Visual processing must include such manipulations of data within a two- 
dimensional array as those performed by the set of routines associated with a 
sub-array called "windo. " As illustrated by Fig. 4-1, the routines are capable 
of the following: 

1 . The "windo" array may be varied in size from IX 1 to 18 X N, to 
(some multiple of 18)XN for special applications. 

2. a. "Windo" can be loaded and unloaded from four directions. 

Hence, "windo" can be "moved" through a larger array; 

b. Also, "windo" can be modified selectively. 

3. A variety of (a) irregular and (b) regularly-shaped regions can be 
defined within "windo" which can be processed directly or mapped 
onto an array called "windo- within". 

4 . Arbitrary positions within "windo" can be processed together. 

5 . "Windo" can function as (a) a one- or (b) two-dimensional shift 
register. In particular, one "windo" can be superimposed upon 
and shifted across another. 

4.3 Initial Approach 

Our initial approach to processing visual data was to reduce a small array of 
data to some sort of line, representing a summary of grey level contours present 
in the array. Grey level contours, frequently termed "edges", occur as sudden 
shifts in grey level, extending typically along the boundary between a shape and a 
background of contrasting luminance. It should be noted that specks of noise also 
cause such level shifts. 
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A computer program which reduces data to the line form first loads "windo" 
with data, then sweeps across this data searching for abrupt shifts in grey level. 
The program marks such a shift by placing a cross, consisting of five binary 
one's, in an array called " windo- within. " A chain of binary-coded crosses may 
form a contour which will extend unbroken between two sides of "windo-within. " 

A subsequent process reduces contours to straight lines, and counts them as 
components of a "vector. " 

A further process endeavors, by means of comparing addresses of data from 

.. jjc 

which matching left and right "eye pointers have been computed, to establish 
the range of the data. 

The line reduction scheme above cannot be used to interpret all the data 
configurations possible in "windo. " It is especially ineffective where high reso- 
lution increases the complexity of the data. 

4.4 A Second Approach 

The line approach suggested that a form of correlation would be useful. A 
method of correlation was then developed whose first step is to differentiate the 
data, such that the presence of an edge is represented by a 1, the absence by 0. 
Afterwards, a frame is transformed into a string of raster lines taken in suc- 
cession. Correlation consists of holding say, the string of the left frame fixed 
while passing the string of the right frame along it, one (bit) position at a time; 
at each position, corresponding elements are "anded" together; thereafter, a 
count of non-zero results is made and entered in a "correlation function. " The 
latter is a block of registers whose length is the same as the number of bits in 
the longer string. 

A "correlation function” can be searched as a list to determine locations of 
peaks and other features. In the case of a simple form of range-finding, the 
location in the " correlation function" of an absolute maximum determines, in 
principle, the range of a feature in the field scanned by the pair of cameras. 

While such a correlation method has been found useful, difficulties, such as 
multiple peaking of the correlation function, arise when the strings are formed, 
since it destroys the two-dimensional character of the input. Also the presence 
of an absolute maximum value of the correlation function does not guarantee 
similarity between two forms being compared. 

Input to the correlation process has been provided by a program which first 
differentiates data then recodes as binary the result of differentiation which con- . 
sists of edges present in the data. Because of the sensitivity of this process, the 
results are often crowded with trivia, and do not characterize simply the original 
data. 

'See subsection 2.3. 
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4.5 The Present Approach 


(16 17 ) 

Experience obtained in modelling animal visual systems' ’ suggested the 
use of integral processing. Routines have been written called "Latin", short for 
"lateral inhibition”, which is a particular case of the lateral interaction of af- 
ferent fibers along pathways leading from animal sensors. 

"Latin” routines substitute, for a piece of data, a weighted average of data 
over a region surrounding it. Weighting functions which are positive value in . 
the center and negative in the surround smooth noise and yield results consist- 
ing mainly of contours and other specifiable features. "Latin” is now being 
tested in the processing of photographs and will be tried in the processing of 
television images when camera-computer chains C and D go into operation. 

4.6 Range Finding Planned for Camera-Computer Chains C and D 

To find a pair of homologous windows in the left and right views, a window 
is first fixed in, say, the left view, according to some criterion such that it be 
the area of highest contrast or of highest luminance; then a "likely area” 
is defined in the other view (see Fig. 4-2). Since this area contains as many 
potential windows as horizontal resolution elements, the problem is to determine 
which of these corresponds to the one fixed in the other view. The simplest way 
is to compare, point by point, the fixed window to all those contained in the area 
of suspicion, generate some error function and use this as a criterion to decide 
when the matching is best attained. A point by point comparison is very suscep- 
tible to shifts of the position of the points and to changes in the gain of the video 
amplifiers. To lessen these difficulties, we may substitute the value of the out- 
put of the camera at a point by an average over a few positions around the one 
being compared, or, in other words, smooth the initial data from the cameras. 
This smoothing is similar to defocusing the original image. 




LIKELY 

AREA 


Fig. 4-2. Finding o pair of homologous windows. 
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Let us represent by f -^(i, j, k, l) the value of the output of the camera at the 
point (i, j) of the window (k, l ), (See left view of Fig. 4-2, ) Let fpj(i> j;m, n) be 
the value of the camera output at the point (i, j) of the window (m, n) in the right 
view. By smoothing, we obtain similar distributions j;k, l) and $^(i, j;m, n) 

The problem is then to find the coordinates (m,n) of the window in the right view 
that correspond to the window (k, l ) in the left view. H' the cameras are hori- 
zontal, l = n, and the width of the likely area equals the width of a window. - 
For every window (m, l ) in the likely area, we compute the absolute value 
of the difference, 

UjM-'M) -®R(hj;mO| 


and integrate it over the size of the window, that is, we form the "error": 


e(M;m, l) 


i-M j=N 


MXN 

i=l j=l 


E E 




where MXN are the dimensions of the windows (M and N are odd) and (17 X 17 
are lines used for experimentation). 


Since, to start, k and l are fixed, (k, l ;m t ) is computed for all values of 
m, that is, from m ^ ^ to F - where F is the number of resolutions 

elements in the horizontal dimension of the parts of the frames viewing over- 
lapping areas. The errors so computed are stored in a list which is searched 
to find the absolute minimum value of error. To the absolute minimum error 
there corresponds a pair of addresses (k, l ), ( m , l ) of windows in the left and 
right view respectively. These windows are regarded as homologous. 


For example, assume that the frames overlap horizontally in 200 resolu- 
tion elements, that the window size is 1 7 X 17, and that a left window has been 
fixed at k = 30, l = 50. For the likely area, n = l~ 50, and the list of 
errors is formed by computing 


e(30,50;m, 10) 


I 


i= 17 j=17 

2 ^2 <f>|_ (i,i;30, 5 0) - ^. R (i,j;m,5 0 ) 


17 X 17 


i=l i=l 


17-1 

for values of m between m = = 8, and m = 200 - 8 = 192, that is, 184 values 

Assume that their absolute minimum value is E (30, 50;35, 50). Windows in posi- 
tion (30,50) and (35,50) are considered homologous. The paralax is 35 - 30 = 5 
elements which, since the geometry of the camera is known, can be converted 
into distance (see Section 3). 


T> 

This method of generating errors is actually equivalent to template pattern 
recognition, where the template is provided by the fixed window. 
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Present studies indicate the need of further preprocessing the digitized 
video data before a point-by-point comparison is done to find range. In the ap- 
proach described here, the preprocessing is only an averaging. When a com- 
plicated scene is viewed — for example, containing many edges — enhancement of 
the edges and reduction of the number of them prior to the range finding is 
necessary. 

A variety of ways of achieving visual data reduction are being investigated, 
with the computer program described in subsection 4 . 2 . It is hoped that experi- 
mentation will lead to a family of weighting functions applicable to different types 
of scenes. This study will lead to specification of the characteristics of the 
computer in the camera-computer chain. 
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SECTION 5 


TOWARD A METHOD OF MODELLING AN ENVIRONMENT 

by 

Louis Sutro 


5.1 General 

Since identification of life and geological forms is ultimately performed on 
Earth, the task of a robot in a planetary mission is to acquire pictorial data and 
reduce it both as a basis for decision there and for transmission to Earth. The 
reduced data must be reconstructed on Earth into a recognizable display. 

To reduce the pictorial data we are here concerned first with the creation 
of a "point model" of the scene, then the determination of properties that are 
intrinsic, that is, invariant, say, to changes in illumination. Such a property is 
the reflectance of each triangular area formed by a group of three neighboring 
points. The point model would enable a robot to select a path to follow and, trans- 
mitted to Earth, provide a three-dimensional map of the terrain. The point-and- 
reflectance model, by outlining areas of different reflectance, should permit 
discriminating surface patterns. 

The point model is considered in subsections 5.2 through 5.6, the point-and- 
reflectance model in subsections 5.7 through 5.10. The final subsection 5.11 con- 
siders the Earth station needed to display these models. 

5.2 Proposed Method of Acquiring a Point Model 

A necessary first step is to aim the cameras at an object of approximately 
known position, such as a flashing cross on the forefinger of one hand. The fore- 
finger will then be moved until it touches an edge or other feature that can be 
identified. The feature will have to be found in two views by the method of 
subsection 4.6 since the cameras cannot be aimed at the point of the finger with 
great accuracy. 

A way of finding a second point, described in subsection 2.3, is to move the 
windows along the edge used to identify the first point. Another way is to place 
the left window over any area with a sharp discontinuity in luminance and then 
search for the homologous point with the right window, while keeping the first 
point where it can be recovered. 
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When three points have been acquired, they can be stored in the computer 
memory as points of a triangle. Using two of these points as the base of a second 
triangle, a new third point can be acquired, and so on. As the number of points 
grows, the uncertainty of position also grows until it becomes desirable to aim 
again at a known point. 

5.3 An Example 

Assume that the robot has landed on Mars and has transmitted to Earth both 
a panoramic view of its environment and its approximate position as determined 
by two observations of the sun. The Earth operator has commanded the robot to 
proceed along azimuth 43" for 50 meters, if feasible, to map and describe the 
terrain as it proceeds, and to transmit this description together with sample 
television pictures. 

Looking in azimuth 43 the robot views a scene as that in Fig. 2.3. It needs 
to form a point model and from that determine a route to follow. 

Let us assume that points are then determined on the surface of the three 
nearest mounds and on the ground among them. Figure 5-1 shows points on the 
surface of the mound at the left center of Fig. 2-3, drawn by the same artist 
who made that illustration. He, of course, knew where to place the points. Where 
the choice of points is made by the coumputer it is better to obtain more points 
so that the connections between them, if made, are not as significant. For ex- 
ample, triangle GEF is a shelf whose appearance is destroyed by connecting 
points E and J. For a point model, connections between points do not need to be 
made as explained in subsection 2.3. But for 'a point-and-reflectance model such 
lines serve as bounds of the planes for which reflectance is determined. 

K 


D 



Fig. 5-1. Triongulorized view of the mound in Fig. 2-3. 
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METERS 

X 

9 

z 

F 

0.14 

3.45 

-0.09 

G 

-0.46 

2.74 

-0.25 

J 

-0.14 

4.64 

0.28 

Centroid of'A FGJ 

-0.15 

3.94 

-0.02 


The coordinates presented in Table 5-1 are computed from the data of Ap- 
pendix E. L They are located with an uncertainty given in the lower right corner 
of Table B. 1 of Appendix B. 

5.4 Interrogation of a Point Model 

To find a passage between mounds, the computer can form what a surveyor 
calls "profiles. " A transverse profile will be defined as one in a plane perpendic- 
ular to the horizontal line of sight into the scene which will be the robot's Y axis 
if the robot is level. Figure 5-2 shows such a profile drawn on the imaginative 
illustration of Fig. 2 - 3 . This profile does not have to be drawn; it can be stored 
as a set of points. 

An algorithm to determine a profile can be developed as follows. A point p 
in the profile is p(x,y,z) where y is constant. To find a point p on the profile for 
each value of x, a volume of the model such as that shown in Fig. 5-3 has to be 
searched, a task which may be facilitated by storage of the model in the form of 
a list indicating the adjoining point in each direction. Such a list can be searched 
for the highest point in each elementary volume shown in Fig. 5-3. 

When the profile of Fig. 5-2 has been formed, it can be searched to deter- 
mine whether it contains an expanse wide and level enough to permit the robot to 
cross. When such an expanse has been found, more transverse profiles can be 
''drawn'' nearer and further away. The apertures found in each case can then be 
connected by longitudinal profiles, and these can be examined to determine if 
their pitch permits the robot to move ahead. 
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Fig. 5-2. Supposed viiDioT of thn> jufocn) of Mars with a profile drawn in 



Fig. 5 - 3 . Volume surrounding point p in the plane of a profile. 


Another form of interrogation that can be performed on the model is general- 
izing from it both to locate the peaks, bases and approximate curvature of the 
mound as well as to conclude that except for the mounds all of the surface shown 
in Fig. 2-3 is an approximate plane. 

5. 5 Interrogation a Point Model in the Search for Evidence of Life 

A profile may be "drawn” of a free-standing object, obtained in a manner 
similar to the making of a transverse profile of the scene. The profile can then 
be checked against a stored criterion such as: Is the object larger above than at 

its base? This is a characteristic of much of plant and animal life on Earth. 

5.6 Mapping 

Mapping of the surface of Mars can be carried out by the methods of a surveyor. 
From a starting station he runs a traverse of straight line segments, measuring the 
length of each and the angle between the forward and backward segments. The inter- 
section of each pair of segments is called a "station" and is marked on the ground. 

A Mars station can be defined as a place where the robot stops to look, and 
the distance between stations may be determined by an odometer. Enough stations 
should be marked to permit the robot to retrace its path. A suggested form of 
marking is a transponder which can be excited by a radio on the robot and homed 
in on. 
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5.7 Representation by Points and Reflectances 


A next step in refining the model of the environment is to record a property 
of the surface particularly one that is invariant under changes in illumination. 

Such a property is reflectance. 

The minimum area for which reflectance needs to be determined is a triangle 
formed by three neighboring points. The amount of computation required to de- 
termine the reflectance will, of course, be weighed against the data reduction 
achieved. The principles to be employed follow. 

5.8 Computation of Reflectance ^^* “ U/ 

A television camera detects a mosaic of values of what used to be called 
brightness but is now called luminance, L. The term brightness is now limited 
to subjective response while luminance is used where the response can be mea- 
sured. The unit of luminance in the United States is the footlambert. Except 
when it is the property of a light source, luminance is the product of illumination 
onto a surface and the intrinsic property of that surface called reflectance. Let 
us consider each of these factors separately, then together. 

Illumination is a vector whose unit in the United States is the footcandle, whose 

representation is E (for kclairage) and whose component normal to a surface is 

i=r The latter two measures are related by the Lambert cosine law: 

%• 


E = E n cos 


(5-1) 


where IT is the angle of incidence (See Lig. 5-4). 




Fig. 5-4. a. The Lambert cosine law showing that light flux striking a surface 
at angles other than normal is distributed over a greater area, 
b. Representation of illumination E as a vector and its normal com- 
ponent as a smaller vector of magnitude E cos 9 \- 
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Three forms of reflectance are identified: diffuse, specular and, spread. 

(See Fig. 5-5. ) Diffuse reflectance p ^ is that such as blotting paper; that is, it 
yields the same luminance from whatever angle it is viewed. Specular reflect- 
ance p_ is that of a polished surface; that is, it reflects light at an angle from 
the normal equal to the angle of incidence. Spread reflectance Pgj-) is that of a 
roughly polished surface, reflecting so that the angle of reflectance only roughly 
equals the angle of incidence. Other forms of reflectance are combinations 
of these. 



a. MATTE SURFACE 
(DIFFUSE) 


b. , POLISHED SURFACE 
(SPECULAR) 


c. ROUGH SURFACE 
(SPREAD) 


Fig. 5-5. The type of reflection varies with different surfaces: 

(a) matte surface (diffuse or Lambert); . 

(b) polished surface (specular); 

(c) rough surface (spread). 


The product of normally incident illumination onto a surface, expressed 
in footcandles, and the diffuse reflectance of the surface p ^ is luminance expres- 
sed in footlamberts, 

P D E n - L (5-2) 


Stated another way, the diffuse reflectance of the surface is the ratio of the lumin- 
ance from it to the illumination upon it. Figure 5-6 v ' shows the use of this 

equation in determining the reflectance of a diffuse or non-glossy surface such 
as a painted wall. In Fig. 5-6(a), a General Electric type 213 light meter has 
been placed with its photosensitive surface against the wall and then drawn back 
two or three inches until no shadow falls on the wall. A typical reading might 
be 40 fL of luminance. In Fig. 5-6(b) the meter is held with its base against 
the wall. A typical reading in this position might be 65 fc of illumination. Sub- 
stituting the two measurements into Eq. (5-2) yields 


40 fL 
65 fc 


60% 
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CELL 2"-3" FROM WALL (no shadow) 



Fig. 5-6. Reflectance measurement with G.E. Type 213 light meter. 


This is an approximate determination because the areas measured are large 
and the accuracy of this instrument is not high. * 

A television camera can be used on Mars to determine diffuse reflectance 
in a similar manner. Aimed at the surface in question the camera measures 
luminance. Aimed at the sun with a suitable protecting filter it can measure il- 
lumination E. Multiplying E by the cosine of the angle of incidence 0. yields the 
normally incident illumination . 

Figure 5-7 diagrams the measurement of diffuse reflectance by a television 
camera. The energy represented by the vectors E^ is reflected in an amount 
determined by the value of and in a manner represented by the spherical 
clusters of vectors. The magnitude of each vector in a cluster bears a cosine, 
or Lambert, relation to the maximum vector in the cluster. It can be seen that 
if the camera is aimed at a steep angle it detects few long vectors, at a shallow 
, angle many short vectors. The luminance detected in both cases is the same. 



Fig. 5-7. Conversion of illumination to luminance when the surface has reflectance 
p^, is large and is uniformly illuminated. 


*An instrument that measures luminance in a narrow angle of view at high 
sensitivity is the Spectra Brightness Photometer. 
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To measure specular and spread reflectance requires the ability to move 
around an object and measure vectors whose magnitude is different in different 
directions as in Fig. 5-5. When specular and spread reflectance are present 
luminance is determined by the polynomial: 


L ’ PD^N + <°SrE + PSD^ 


(5-3) 


where 

^SR = specular reflectance 
PgQ = spread reflectance 

Note that the illumination E from the light source to the reflecting surface is used 
in the second and third terms. To measure specular reflectance it is necessary 
to move the camera to a point such that the light source, the measured surface, 
and the camera are in the same plane, with the angle equal to the angle of reflec- 
tion. To measure spread reflectance, the same conditions must be followed and, 
in addition, the angle of reflection must be varied about the value equal to the 
angle of incidence to measure the "spread" of the light. 

In most illuminating engineering work, luminance is computed from only the 
first term in the above equation. There may, however, be three such terms, one 
for each light primary. These are referred to as "tri-chromatic coefficients"^^ 
Color measurement, which this implies, will be considered later. 

The point model can be acquired under any illumination sufficient for edges, 
differences id shape, or differences in reflectance to be detected as contrast. 

The point-and-reflectance model, on the other hand, requires measurement of 
the illumination. To acquire the second kind of model, therefore, the robot must 
first determine the illumination upon the surface or, better yet, of the illumina- 
tion upon the scene. 

From its observations of the sun at two times in one day the robot can pre- 
dict at any other time, the azimuth and declination of the sun and the resulting 
illumination normal to any surface of known position. Because there is almost 
no atmosphere on Mars, it is expected that there will be almost no sky light, 
except during dust storms. Then the lenses of the cameras will probably have 
to be capped to protect them from injury. The only sources of light are expected 
to be the sun and light reflected from the surface of Mars. In the example that 
follows only sunlight is considered. 

The method presented here is similar to that of a painter who sketches a 
scene so that he can return to his studio to paint it. He observes the direction 
of the sunlight and, if he is acquainted with the effect of sunlight, keeps it in the 
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back of his mind as he concentrates on shapes and their reflecting properties. 

In doing this he is recording a scene invariant to illumination. Returning to his , 
studio he can re-introduce sunlight as he paints the highlights, shades and 
shadows. 

No painter that we know is aware of the mathematical methods of this task. 
Such methods are common in illuminating engineering, for example, where to 
achieve a desired appearance of a room, bulbs, wattages and reflectors as well 
as the reflectances of walls, ceiling and floors are considered. 

5.9 Computation of the Diffuse Reflectance of AFGJ 

The example of subsection 5.3 will now be extended by determining 

1. The direction components of the normal to triangle FGJ, computed 
in Appendix E.2, 

2. The average luminance L cf this triangle, measured by the left 
camera, as 1300 footlamberts, 

3. The illumination E from the sun, measured also by the left camera, 
as 4000 footcandles, 

4. The direction components of sunlight, determined from the position 
of the sun stored in the computer memory, 

5. The angle between the direction of sunlight and the normal to AFGJ 
computed in Appendix E.3 to be cos ^ 0.71, 

6. The illumination normal to the triangle, employing Eq. (5-1), 

“ 4000 x 0.71 - 2840 footcandles 

7. The average diffuse reflectance of AFGJ, employing Eq. (5-2), 


1300 
Pd “ 2840 


0.46 


Measurement of the luminance of the remaining triangles, together with con- 
tinued application of the above equations, will yield the average diffuse reflectance 
of every surface of the model. 

5.10 Interrogation of a Point-and-Reflectance Model 

A point- and-reflectance model can be interrogated for properties that can 
not be extracted from the point model. For example, in a point-and-reflectance 
model of a scene containing dark rock and light sand, the computer could deter- 
mine their boundary and report it to Earth; or, on command from Earth, the 
robot could explore one area and ignore the other. 
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5.11 The Earth Station 


Two ways of transmitting pictorial data to Earth are being planned. One is 
transmission cf television pictures, either monocular or stereoscopic, the other 
transmission of the model of the surface of Mars employed by the robot in mak- 
ing decisions. The model can be presented on the same screens (Fig. 5-8) as the 
television pictures, provided a computer is employed on Earth to store it and 
arrange it for display. The arranging process can be such as to give the viewer 
the impression of moving through the scene. 

If the model is composed of points and reflectances then the Earth operator 
will command under what illumination he views it. Figure 5-9 illustrates a pos- 
sible display whose luminance to the eye of the operator is computed from the 
reflectances of the surfaces and from illumination that he has commanded to come 
from the upper right of the scene. 
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POLAROID 



Fig. 5-8. Stereo picture presentation using two 23-inch monitors and Polaroid filters. 
Orientation of each filter in glasses corresponds to orientation of filter 
before each monitor. 



Fig. 5-9. Triangularized view of the mound in Fig. 2-3 shaded 
a s though by an Earth computer. 
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SECTION 6 


ADDITION OF A LEARNING ABILITY 
TO THE DECISION-MAKING SUBSYSTEM 

by 

William L. Kilmer, Jay Blum and Warren McCulloch 


6. 1 Introduction 

In a temporally changing environment, a static decision-making computer 
such as the model described in Refs. 3 and 4 may be at a significant disad- 
vantage. Consequently, that model is being extended from that which has 
been S-RETIC (where S denotes a static environment) to a model of a decision 
computer capable of several forms of conditioning and learning. The new model 
is called STL-RETIC, where T denotes a temporally changing environment, L 
a conditioning and learning ability of the kind described in Section 9. 

As simulated on the Instrumentation Laboratory general purpose computer, 
the static S-RETIC model consists of twelve modules (M^), inputs (y's) to the 
modules from the simulated environment (£), and outputs (P) to threshold ele- 
ments. The decision computer commands a mode of activity only when a 
threshold has been exceeded. 

To make learning possible, the model is being modified to include logic 
and memory elements reinforceable as a function of experience. The logic 
elements will operate in the A-part of each module, while memory elements 
will store past inputs from the environment, messages from other A-parts, and 
past A-part computations. Figure 6- 1 shows eachy input to an M,- paralleled 

cl 

by a memory element and each connection (w) from another similarly 

paralleled. The number of y inputs from each Mi, has been increased from five 
to seven. Note that the design continues to be one that can be built of hardware 
as well as simulated. 

6. 2 Requirements 

The augmented STL-RETIC structure ought to satisfy the following re- 
quirements: 
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First, if the environment is rich in stimuli and if the responses which follow 
these stimuli are at least partially unknown, it is not possible to establish an 
input- output correspondence table defining for the decision computer which acts 
should follow which stimuli. The computer must be trained. 

Second, the decision mechanism should respond differently to different 
overall environments; it should be able to adjust to a new input-output specifi- 
cation, so that it need not be reprogrammed. 

Third, if one or more of the robot's sensory or action subsystems fails, 
the decision computer should compensate by changing its responsee to input 
stimuli. 

6.3 Nature of the Problem 

A difficulty in augmenting S-RETICto be able to learn is that each module 
appreciates only part cf the input-output correspondence. How can all of the 
modules function harmoniously in the same direction, so that the effect is co- 
operative and not chaotic? Generally, each module is most sensitive to dif- 
ferent input stimuli; for example, module 5 may discriminate among overall 
(D input sets 67, 68, and 69, whereas module 4 may discriminate among inputs 
23, 24, and 25. Moreover, neither may discriminate at all among other input 
sets. If one module makes necessary distinctions between two cases which call 
for different responses, it has to be able to notify neighboring modules of these 
distinctions as they occur. This would enable the other modules to act appropri- 
ately. 

6.4 Approach to the Problem 

The problem mentioned in subsection 6.3 may be limited by requiring that 
the decision-making computer be effective only over several hundred successive 
stimuli. Each call for a change in response would be stored in memory elements 
so that, when several hundred had been accumulated, actual alteration of the 
response would occur. Naturally, the smaller the number of stimuli required, 
the better the model will be able to yield up-to-date responses. Thus, the bene- 
fit of reduced complexity achieved through storage of desired responses must be 
weighed against the delays intrinsic to such storage. 

6.5 Implementation by Operations on Vectors 

Each kind of learning will be implemented by operations on the vector 

which goes from the A to the B part of each module. These operations will 
be similar to those described in subsection 5.6 of Ref. 4: flatten vectors, 
elevate vectors on certain modes, depress them on certain modes, measure the 
peakedness of vectors in order to decide what to do with them, and permit one 
vector to change into another systematically, smoothly, and gradually. -- 
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CONNECTIONS TO HIGHER MODULES 
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6.6 Structure of the Model 


Figure 6-2 is a schematic representation of the relation between the three 
timing parameters used in STL-RETIC. At the present time t at which a stimu- 
lus i s applied, the model looks back to the time cf the application of the last 
stimulus, X^ “the time difference is then T. If the model converged after ap- 
plication of Xj, the time to convergence fromt-T is called r, with.rpc(t), in 
this case, the reinforcement due to Ly rac(t) the reinforcement due to con- 
vergence. IF no convergence occurs, reinforcement over the entire period T 
is that due to X^, namely rpc(t). Note that it is not until the application of 
X 2 that reinforcement due to convergence may occur and that consequently the 
reinforcement signals rpc and rac are written as rpc(t) and rac(t). 

A condensed flow chart of the stimulus-convergence-reinforcement process 
is given in Fig. 6-3. After getting the first stimulus X, computations based on 
receipt of the stimulus are carried out, after which the model checks for con- 
vergence. If convergence has not occurred, present time is incremented one 
unit and the model looks for a new input stimulus X. Otherwise, if convergence 
has occurred, the model checks for a new input without augmenting time. If, in 
either case, a new input has not been received, the parameters used to determine 
convergence are recomputed. Note that these change in value from what they 
were previously only if time has been incremented. Thus, if convergence occurs, 
the model remains converged until receipt of another stimulus. (Stays ill left- 
hand loop). 


t- T 
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PRESENT TIME 

T - PERIOD BETWEEN CHANGES IN 2 
r = PERIOD OF CONVERGENCE 


Fig. 6-2, Reinforcement parameters at the time t when ^ ' s applied. 
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If a new input has been received, reinforcement occurs, regardless of 
whether or not convergence has been attained. 

Finally, the model recomputes the effects of the new input on the conver- 
gence parameters, and the entire process is repeated, coming to a halt only 
when a "STOP SYSTEM" command is given to it. 

Note that the dotted lines in the upper middle of Fig. 6-3 represent the old 
S-RETIC model. Thus, it can be seen not only that STL-RETIC performs the 
same tasks of receiving stimuli and converging upon a mode of activity as were 
done with S-RETIC, but that the augmented model is capable of reinforcement 
and its consequent output effects. 
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PART II 


RESEARCH INTO THE NATURE OF ANIMAL SENSORY, 
DECISION AND CONTROL SYSTEMS 



SECTION 7 


ANIMAL COMPUTATION 

by 

Louis Sutro and David Lamport 
with the technical assistance of Warren McCulloch 

7. 1 Background to a Computer Model of a Vertebrate Brain 

Lrom his life-long study of the human nervous system. Dr. Warren 
McCulloch has concluded that the essential properties of human computation 
must serve as a basis for the electronic acquisition and storage of human 
knowledge. Although as a neurologist, psychologist, and physiologist he is 
aware of the dangers involved in embodying mental functions in physical devices, 
he has nevertheless developed a simplified model of a vertebrate brain. His in- 
tention is merely to suggest an organizational hierarchy necessary for efficient 
robot performance. 

Ligure 7-1 outlines his model of the vertebrate nervous system, identifying 
what he feels are five principal computational areas and their functional connen- 
tions. At the left is the retina, consisting of three layers of cells, two of which 
are engaged in computation. The eye is shown as representative of the senses 
because its computational capacity qualifies it as a principal computer; it is the 
foremost data source to the primate brain, providing two million of its three mil- 
lion inputs. 

At the upper left is the cerebrum , which Dr. McCulloch calls the "great 
computer" and in which computation is carried out in many layers; each of 
which, in man if unfolded, would be about the size of a large newspaper. 

The computer which controls all others is shown at the center right. It is 
the reticular core of the central nervous system and extends from the base of 
the brain through the spinal cord. It makes the major decisions, or rather, 
shifts the focus of attention so as to determine what is to be done from moment 
to moment. By deciding which command function is to take over, it thus con- 
trols all other computers and, through them, the whole organism. A model of 
this computer was presented in Refs. 3 and 4 and further developed in Section 6 
of this report. 
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TRANSDUCERS, EITHER MUSCLES OR GLANDS 




Clusters of nerve cells at the base of the cerebrum comprise the basal 
ganglia , a computer shown at the lower left of the figure. Here are programmed 
all innate or learned movements, such as feeding, walking or throwing a ball. 
Additional programs may be acquired through the growth of connections to the 
motor control nerve cells, shown along the bottom of the illustration. 

Completing the list of principal computational areas is the cerebellum , shown 
at the top of Fig. 7-1. It programs the completion of a movement, such as reach- 
ing to touch an object, and requires inputs both from the vestibular system, to 
detect tilt and acceleration of the head, and from skin and muscle sense cells, to 
detect posture and information as to what is being touched. 

Interconnected with the principal computers are switching structures, such 
as the thalamus, colliculus, and cerebellum anteroom. In lower verterbrates (fish, 
amphibians, birds), the colliculus perceives form and movement; in higher 
animals, it is subdivided into a superior colliculus, which determines the direc- 
tion of gaze under the general control of the cerebral cortex, and an inferior 
colliculus, which is probably concerned with auditory and vestibular inputs as 
well as the somatic body image. The base around the colliculus is the tegmen- 
tum. 

Around the reticular core are specialized structures that could also be 
called computers, such as the nucleus of nerve cells that control respiration 
and other routine bodily functions, and the dorsal horn of the spinal cord, 
through which pass inputs from sense cells. Note that the reticular core acts 
on all other computers and that they report to it. It reaches decisions with the 
aid of raw data from sensory systems as well as processed data from other 
computers, the latter of which comprising its primary input source. 

The computers of Fig. 7-1 are shown as arranged in animals with horizontal 
spines. Monkeys and man have the same computers in approximately the same 
relation, but the arrangement is tilted, with the cerebrum, now very much 
larger, at the top. 

All these computers have a common ancestry. All evolved from the central 
computer, the reticular core, and in so doing have established only those inter- 
connections necessary for efficient communication with "retie, In this manner 
the reticular core has been constantly able to maintain the complexity necessary 
to meet the aggregate demands of the entire system. 
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7. 2 A Possible Engineering Equivalent 

Figure 7-2 is a diagram equivalent to Fig, 7-2 but labelled with engineering 
terms to suggest how the system can be simulated. In place of the retinae are 
the cameras and the visual first- stage computer. 

First-stage computers receive inputs from all of the senses — auditory, 
vestibular and somatic sensory. Each is called a computer rather than a pre- 
computer or preprocessor to indicate that it receives feedback from the central, 
computers ■ 

Other substitutions are as follows: 


decision computer 

for 

reticular core 

associative computer 

for 

cerebral cortex 

timing, coordinating and 
autocorrelating computer 

for 

cerebellum 

computer of effector 
sequences 

for 

basal ganglia (nucleii) 

computer of specialized 

for 

lateral, reticular nucleii 


controls 

7.3 Memory in Biological Computers 


If models of the computers described above are to be built, what will be the 
form of the memory? 

Computer designers recognize three principal forms, but there is a fourth 
more important for our present work. The first form is storage on tape where 
data is found by searching linearly. The second form is random access where 
data is found by its address. In the third form, data is found by searching for 
part or all of its content. This has unfortunately come to be called "associative" 
memory. 

The fourth form is the true associative memory employed in animal brains. 
Here data is found by a relation. For example, if you are looking for a friend in 
a crowd, you may find him by recognizing the proportions of his face or the time 
of day that he passes by. These are relations. You acquire this form of memory 
by constantly updating a model of the world in your head. A change in the model 
is considered to be the formation of a trace embodying the relation. (The con- 
nection for the trace may have existed but have had a different threshold than 
that required. ) Conditioning, which is the most elementary method cf forming 
trace, is described in Section 9. This form of memory will be simulated, at 
first, by the other three. Hardware (orchemistry) more appropriate to a true 
associative memory may evolve. 
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SECTION 8 


VISUAL PROCESSING IN ANIMALS 
by 

Roberto Moreno-Diaz 


8.1 Development of a Computational Model 

Existing models of visual processing in the frog's retina and tectum (Refs. 
17, 21, 22) suggest a concept which may be applied to retinas of higher animals 
as well. It is hoped that a model of this concept will soon be realized which, 
founded in known anatomy and physiology, may be further extended to apply to 
tasks such as those described in Section 4 of this report. 

The concept cf visual processing being developed may be regarded as a 
layered computation; that is, elements of a similar nature comprise a given 
layer, and computation occurs first through interconnections between elements 
in the same layer, then between those of different layers. In a gross sense, a 
succession of such layers will come to represent the higher animal retina. Al- 
though there are only three significant cell types in vertebrate retinae (photo- 
receptor, bipolar, ganglion), the number of computational layers is larger, 
since interaction may occur between dendrites at more than one level. 

What is the nature of computation in each layer? Physiology suggests 
the possibility, at the photoreceptor-bipolar level, of lateral inhibition. As- 
sume a mosaic of photoreceptors which feed bipolar cells, such that a par- 
ticular bipolar receives inputs from an area containing many photoreceptors 
by means of photoreceptor ends, or'"fibers. " Lateral inhibition may then 
be formulated as follows. Consider a typical bipolar cell, called "B" in 
Fig. 8- 1. It receives both an excitatory signal from the photoreceptor — or 
photoreceptors — situated directly above it as well as inhibitory signals from the 
surroundings. If the effect of the inhibition is subtractive, the total signal ar- 
riving at the bipolar B would be 

b-Ev 

A i 
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Fig. 8—1. Dendritic level, defined by areas A] and A 2 , 
with dendrites leading to cell body B. 


where F^ is the output of a photoreceptor transmitting an excitatory signal, Fj 
the output of an “inhibitory” photoreceptor, and and Ag the areas of excitatory 
and inhibitory photoreceptors respectively. 


Equation (8-l)refers to interaction only between the photoreceptor-bipolar 
levels. Regardless of the location of such interaction in the retina, it is possible 
to postulate a corresponding layer which produces the same effect. Let F(a,p) 
be the output of an ideal photoreceptor situated at the point (a, (3) (in a suitable 
coordinate frame superimposed over the retina). This corresponds to the input 
of an idealized computational layer, called a layer of lateral inhibition. Let 
B(x, y) be the output of that layer corresponding to the point (x,y). The trans- 
formation rule that defines the computation at that layer is then 


B(x,y) 


(a,/3)eA, 


£ F (a,/3) £ F( a, /3) 

,-A , *’9 / 1 


2 U,/3)eA 2 


( 8 - 2 ) 


where summation is over all points (a, (3) in areas A^ and Ag> and and Ng 
correspond, respectively, to the number of elements in A^ and the number in Ag. 

The values of the coordinate variables x, y, a, and P, according to the dis- 
crete nature of the retinal mosaic, are first considered discontinuous, taking, for 
example, values 1,2, 3,. ... If the area Aj is reduced to a single element, that 
at position (x, y), fiy. (8-2) becomes 

B(x,y) = F(x,y) -j^-£ F (a,/3) (8-3) 

a 2 

If the elements of the retina are sufficiently small and numerous, the sum- 
mation may be replaced by an integral, to give 

B(x,y) = F(x,y) -j- ff F(a,p)6ad/3 (8-4) 

2 A 2 


5 * 
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The effect of this transformation is known to produce contrast enhancement 
(Ref. 23) and redundancy reduction of the initial image. 

In the above formulation, variables B and F are considered time inde- 
pendent. This restriction will be eliminated in future modeling to take time 
into account. 

. ti t Ii ; f it 

Equation (8-2) defines computation over specific areas, giving equal weight 
to each photoreceptor in a field. That equation may be generalized by introduc- 
ing a set of coefficients which reflect the relative strength of a particular input 
fiber connected to the output fiber under consideration. Let C(x, y, a, p) be the 
strength of the input in position (a, p) over the output in position (x, y). Equation 
(8-2) then becomes: 

B(x,y) = 23 C(x,y,a,/3)F(a,/3) (8-5) 

R 

where the summation is over the entire retina. 

That Eqs. (8-2 and 8-3) are particular cases of Eq. (8-5) is easily veri- 
fied. To obtain Eq. (8-2), let C(x, y, a, @)be +l/N^ over the area A^, - 1 / N 2 
over Ag, and zero elsewhere; Eq. (8-2) is then the consequence. Let C(x, y, a, p) 
be +1 for C(x, y, x, y), -l/Ng over Ag» and zero elsewhere; the result is Eq. (8-3). 

Equation (8-5) is called the general expression for lateral interaction, of 
which lateral inhibition is a specific case. In the continuous retina, Eq. (8-5) 
becomes 

B(x,y) = //C(x,y,a ,/3)F(a,/3)dad/3 (8-6) 

R 

Theoretical work continues at present to investigate various forms of the 
weighting coefficients to obtain different processes. It has been found that 
Eq. (8-6) explains several properties of the visual system — such as reliability, 
in the sense that the destruction of a fiber B(x, y) or a group of them, produces 
not blind spots but merely a general loss of resolution. Such points will be ex- 
amined in detail in future reports. 

Computations as expressed in the above equations are not easy to carry out 
analytically in any but the most simple forms of F(a, p) — which are not the most 
interesting ones, but clues about the behavior cf the transformation have been ob- 
tained analytically through treating the simple one-dimensional case. Fora real 
image in the retina, however, the form of F(a, p) is extraordinarily complicated 
and requires a simulation of the kind presented in Section 4. 
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8.3 Examples of Lateral Interaction 


For the one-dimensional case, Eq. (8-4) becomes 

r x+s 

B(x) = F(x) f F(x)dx 

X“S 

where 2 s is the size of the inhibitory area, which in this case is a line. Statisticians 

call the second term a "moving overage” . B(a) may be regarded as the result 

of a transformation T upon F(x) . The transformations of some simple functions 
s 

are: 

a) When F(x) = Ag(x) 5 with A = constant, then the transformation is: 

T g (Ag(x)) = AT g (g(x)). 

Proof: 

1 x+s 

T s (Ag(x)) = Ag(x) f Ag(x)dx 

X~ S 

= A |g(x) - — / g(x)dx] 

X“S 

= AT s (g(x)) 

b) When F(x) = Ag(x) + Bh(x) , with A, B constant, then T (F(x)) is: 

s 

T (Ag(x) + Bh(x)) = AT (g(x)) + BT (h(x)) as suggested by the above proof 
s s s 

c) For F(x) = A, with A = constant, 

T g (A) = 0 

d) For F(x) = Ax , with A = constant, 

T (Ax) = 0 
s 

Proof of d: 

. s+ x 

T s (Ax)=Ax-^- / Axdx 

S~X 

1 r A(s+x) 2 A(s-x) 2 ! 

= Ax-i[ 2 2 J 

= Ax - j- ^2 + 2sx + x^ - + 2sx - x^ j 

* Ax - [4sx] 

4s 

= 0 
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For x negative, the transformation of the above function is: 

, 0 j x t s 

T s (F(x))=A +B -fis /(A +B)dx / Bdx 

X- s 0 

A x, . 

= T (1 +-) -s < X < 0 

<L S 


For x positive: 

0 

T s (F(x)) = B /(A+B)dx-rF jl +s Bdx 

2s x- s lS 0 

Ax, . 

--T-- - 1 0 < x < s 

- I s ~ 


Forx<-sorx>s, 


T s (F(x)) - 0 


because the function is a constant. 
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SECTION 9 


LEARNING IN ANIMALS 

by 

Charles Sigwart 


9.1 General 

At the end of Section 7 memory traces in the nervous system were men- 
tioned as the embodiment of relations comprising a model of the environment. 

The possibility of changing such a model with new environmental situations im- 
plies memory and learning. Attempts to define learning and memory have oc- 
cupied the efforts of many neurophysiologists and psychologists for a long time. 
Thus there exists a body of literature concerning mathematical learning theory 
based on behavior and a range of hypotheses about the relation of learning and 
memory to structural, chemical and functional properties of the nervous system. 

Learning is a term which encompasses a range of phenomena extending from 
simple changes in responsiveness to a stimulus to the very complex capacity for 
dealing with abstractions possible in man. If we can accept a broad generality, 

"We can define learning as that process which manifests itself by adaptive 
changes in individual behavior as a result of experience.. .n(24) 

Learning cannot be defined more precisely since mechanisms underlying 
many complex learning phenomena are not understood. The more elementary 
forms of learning such as habituation and conditioning have been subjected to 
neurophysiological study. In the following paragraphs of this section various 
concepts relevant and desirable in the behavior of a robot will be described. A 
more detailed discussion cf recent research on learning may be found in Ref. 25 
and Ref. 26. 

9.2 Classes of Response 

The decision computer's responses to stimuli may be classed as either 
momentary or predictive. Those which are momentary are a consequence only 
of the stimuli which they directly follow; those which are predictive occur when 
the decision computer receives a stimulus recognized to be a precursor of a 
later event and acts in accordance with that event. STL-Retic will implement 
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momentary response properties such as habituation and classical conditioning 
as well as predictive responses by incorporating operant conditioning and gen - 
eralization. Implicit in providing the capacity to change response patterns is 
the ability to modify generalizations when they become inappropriate. 

9.3 Classical Conditioning 

Classical conditioning (also called Pavlovian and Type 1 conditioning) involves 
the association of a reflex response with a new stimulus. For this to occur the 
new stimulus which is to become a conditioned stimulus (CS) must regularly pre- 
cede the unconditioned stimulus (US), which normally produces the unconditioned 
response (UR). It is assumed that the 'stimulus' may include many factors in a 
vertebrate animal. 

"Sensory signalling that takes place during conditioning involves 
at least two differentiable sensory channels: specific and nonspecific. 

The nonspecific channel apparently establishes background conditions 
for widespread reception and retention of particular sensory and motor 
signals which may enter by way of specific and nonspecific channels.. , 

The net result of all this is an alteration in distribution and influence 
of incoming and outgoing signals which increases the likelihood that 

certain responses will occur in relation to certain previously ex- 
perienced stimuli, it (27) 

A working definition for classical conditioning is as follows: 

"A stimulus class (CS)that before the conditioning procedure does 
not produce a response, or produces it with low incidence will, after 
the conditioning procedure, produce the response with a frequency 
above some specified criterion. The conditioning procedure consists 
in pairing of the CS with another stimulus (US) that ordinarily produces 
an unconditioned response (UR). The paired presentations will be in a 
certain order (CS - US) with a CS-US interval with specified limits 
both with respect to mean duration and variability. The CR must meet 
criteria of resemblance to the UR. 

The production of new relations between a stimulus situation and a response 
may be considered as a memory allowing the building up of a class of stimulus 
situations to which a response is elicited. 

9.4 Habituation 

Habituation is ", . .the relatively persistent waning of a response as a result 
of repeated stimulation which is not followed by any kind of reinforcement. It 
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is specific to the stimulus and relatively enduring., .distinct from fatigue and 
sensory adaption. It is assumed to be a central phenomenon, exclusive of 

receptor or effector fatigue. 

The main function of habituation is to eliminate insignificant responses by 
reducing sensitivity to a specific stimulus* which is repetitive. The man who 
hears a jack hammer going in the street stops paying attention to it after a 
while. Similarly the decision computer should ignore stimuli which it finds 
unimportant. When the robot does something new in its environment, or when 
that environment changes, such stimuli will occur and the decision computer 
should eradicate their traces as they enter over sensory subsystems. 

9.5 Discriminative Learning 

The capacity to generalize, given a set of stimuli appropriate to a particular 
response, so as to include in that set related stimuli, is known as discriminative, 
or perceptual, learning. It is directly involved with conditioning and habituation, 
in that effective response to an appropriate stimulus requires correlated sensory 
data. 

Since changes resulting from generalization are reversible, those that are 
inappropriate should be unlearned as soon as a contradiction arises between re- 
sults of behavior and expectation. Thus, the capacity to generalize depends on 
perceptual discrimination as well as information from the senses. 

9. 6 Instrumental Conditioning 

Instrumental conditioning (also known as operant or Type II conditioning or 
trial and error learning) involves the modification cf a response pattern dependent 
on a reinforcement. This is in contrast to classical conditioning, where rein- 
forcement shapes the selection of the appropriate stimulus. Instrumental con- 
ditioning and its variants involve modification of a response that was initiated by 
the animal. In essence the animal is changing the probability of a particular re- 
sponse to a given situation consequent upon the reinforcing result, whether pain- 
ful or pleasurable. Thus this type of conditioning includes avoidance conditioning, 
where a potential hazard is recognized and elicits a response to avoid it. 

It is clearly important for the robot to eliminate undesirable responses to 
hazards. A task of the decision computer must be to seek other responses to 
lessen hazards in situations it encounters. 


70 



SECTION 10 


SUMMARY AND CONCLUSION 


The search for evidence of life and geological changes on Mars we interpret 
as a problem in reducing the data collected initially to a smaller amount cf sig- 
nificant data to be sent to Earth, and of changing the mode of operation of the 
data-acquiring device (robot). Reducing the amount of visual data is called 
visual processing and modelling; changing the mode of operation, decision. We 
aim by data reduction to employ the limited transmitted information more ef- 
ficiently than by conventional TV. By permitting the robot to change its mode of 
operation we aim to enable it to decide what to do next. 

Components have been described which, when tied together, will comprise 
the visual subsystem of a robot. The computer-controlled television camera is 
sufficiently open in its design to permit analysis and resynthesis as demands 
upon it change. A succession of stereo-optics designs, from Cl through D1 to 
El, has led back to a type with fixed convergence, called C2, as the one most 
likely to be developed next. Means have been devised of processing stereo pairs 
of pictures to extract range. Two structures have been planned to store a model 
of the environment, one employing points, the other points and reflectances. 

Data reduction is achieved first in the formation of a model and second in the 
selection of parts of that model (interrogation) for transmission to earth. One 
purpose of this interrogation is to search for evidence of life and geological 
changes. Another is to find paths that the robot can follow. 

The contact subsystem is a hand-and-arm and a means of modelling the 
environment contacted. The hand is being planned at present both to feel the 
shape of objects and to provide a signal, such as a flashing cross, on which the 
cameras can converge at the start of the exploration of a scene. 

The decision subsystem, previously devised to respond to a sequence of 
static environments, is now being augmented by logical and storage elements so 
that it can learn. The structure of this subsystem is such that while it is now in 
the form of a computer program it can later be implemented in special-purpose 
hardware. 
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The mobility subsystem is at present, a vehicle, wheel drives and controls, 
originally designed for a lunar landing by the General Motors Defense Research 
Laboratory and since adapted by them for a Martian landing. The synthesis cf 
visual, contact, decision and mobility subsystems we call a robot. Three con- 
figurations of this robot have been pictured, the preferred one having the camera 
assembly directly above the front axle of the vehicle. All are capable of mapping 
by measuring distances with an odometer and angles with a radio direction finder 
aimed at transponders previously dropped by the robot. Accuracy can be im- 
proved by sighting landmarks. 

While this program is expected to be primarily one of computer development, 
only commercially-available computers are used at present. The purpose of 
this is to determine by simulation on existing machines the specifications of the 
desired machine > 

The first part of this report concerns the developments just summarized. 

The second part describes research into the organization of animal nervous sys- 
tems, into the nature of visual processing and the nature of learning in animals. 
The organization of animal nervous systems is that found by Warren McCulloch 
and presented here in computer language. Visual processing is that found by 
physiologists and presented here analytically by Roberto Moreno-Diaz. "Learn- 
ing in animals" is a summary of this subject as presently understood in psychology 
and physiology. These investigations will be the basis of further development of 
the kind described in the first part of the report. 



Note to Section 3 


APPENDIX A 

MEASUREMENT OF VIDICON RESOLUTION 
by 

David Tweed 


The method employed in the measurement of vidicon resolution is that de- 
scribed in Ref. 25. Figures A-2 and A- 3 in that reference are replaced respec- 
tively by a photograph of an oscilloscope presentation of the output of the Cl-1 
camera chain and a graph derived from this presentation. Quoting from Ref. 25: 

"Anew type of test pattern was devised (Fig. A-l) which consists of 
nine line-groups; each of these groups contains a black and white bar to re- 
present 100% response factor, followed by ten sets of four black and three 
white lines which represent lOOto 1000 TV lines (across the width of the 
chart), in 100-line increments. 

"This new chart makes it possible to obtain data for a complete square - 
wave aperture response curve with one oscilloscope presentation. Figure 
A-2 is typical of the type of presentation that is obtained from an oscilloscope 
that is fed the video signal and set to select one horizontal scan line when an 
image from the line selector pattern is being transduced. 

"The procedure for obtaining the aperture response curve is as follows: 

1. The tube is set up under the desired test conditions, in this case 
optimum overall performance. 

2. The test pattern is the "line selector" pattern of Fig. A-l. 

3. The video signal from a selected horizontal line is presented on an 
oscilloscope and a photograph is taken. See Fig. A-2. 

4. The photograph is measured for the relative response of the line- 
sets to that of the black and white bars, and a curve is plotted. 

See Fig. A-3. " 
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Fig. A-1. "Line Selector" test pattern (Westinghouse resolution chart ET-1332 purchased from 
Tele-Measurements Inc., 145 Main Ave., Clifton, N.J.), reproduced 1/3 full size. 



LINES IMAGED ON 
TV CAMERA FRAME 


Fig. A-2. Tracing of oscilloscope presentation of video signal from part of single scan 
line of Fig. A-1, employing the amplifier described in subsection 3.5. 
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Note to Section 3 


APPENDIX B 

RANGINGANDRANGEERROR 

by 

Charles Sigwart 


B. 1 General 

To obtain the range of objects in the environment it is possible to use a 
stereo optical system and in essence triangulate. Thus the range z to point P is 
determined by measuring and a^, knowing that the base separation of the two 
optical systems and is 2b. The measurement of range is uncertain to the 
extent that the measurement of convergence angles and ag is uncertain. In 
general there will necessarily be some small angular uncertainty ±5 in measuring 
the orientation of the optic axis. Thus the localization of P by the methid is in 
a volume bounded by the intersection of the two cones whose axes intersect at P. 
The cones have a half angle 6 as shown in the plane of the axes in Fig. B-l. 

The intersection of the plane of the optic axes with this volume produces the 
quadrilateral of uncertainty surrounding P. 



Fig. B-1. Quadraloteral of uncertainty surrounding point P when viewed by 
cameras at A] and A 2 . 
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This zone of uncertainty varies with and a ^ as indicated in Fig. B-2. Thus 
it is apparent that ranging on objects along the line q Q away from the z axis re- 
sults in an increasing uncertainty of measurement and that the range error is 
minimized by restricting the convergence angles such that The effect 

shown in Fig. B-2 is of course amplified by the large angular uncertainty 6 
shown, but the general result is that range uncertainty increases significantly 
with deviation from the condition of equal convergence angles. Thus the follow- 
ing discussions will deal with range uncertainties for points on the center line. ■ 


B, 2 Range and Uncertainty for Points on the Q Axis 

To keep the range uncertainty minimal as indicated above, we will consider 
only the range of points on the Q axis, employing equations derived by Roberto 
Moreno-Diaz and David Tweed. The range of an object at P shown as OP in 
Fig. B-3 is determined from 

q m — — (B-l) 


tan a 


•The range is measured from the midpoint of a line of length 2b connecting the yaw 
axes of the imaging system. Each yaw axis intersects an optic axis at point A 
distance x from the image plane. The convergence angle a is measured from the 
forward direction (parallel to Q). N is the nodal point of the lens. 


Considering the similar triangles in Fig. B-3, AMCN ~ ASEN, 


MC/CN + ES/EN or 


q o +Aq - (f - x) cos a 
b - (f - x) sin a 


f cns a + S sin a 
f sill a - S COS a 


we see that 


(B-2) 


The range uncertainty in the direction of increasing q is denoted +Aq, shown as 
PQ. Thus from Eqs. (B-l and B-2) we obtain 

. bs - (f - x) s sin a 

+ Aq = n CB-3' 

sin a(f sin a - s cos a) ' 

Similarly a displacement in the image plane in the opposite sense, -s, gives the 

uncertainty in range in the -Q direction from P. 


-Aq = 


-bs +(f -x) s sin a 
sin a (f sin a + s cos a) 


(B-4) 


The second term of the numerator is minimal when the yaw axis is through the 
point on the optic axis a distance x = f in front of the image plane, then (f-x) = 0. 


It can be shown that range uncertainty increases approximately with the 
square of the range. 

Substitutions into Eqs. B-3 and B-4, noting that a = tan * b/ q Q , using values 
of q o (from 0 to 3 meters at intervals of 0.5 meter) , b and f from the text and 
As from Table 3-2, leads to the range uncertainty of camera assemblies D1 and 
El tabulated in Table B. 1 and plotted in Figs. 3-5 and 3-11. 
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A2 


Fig. B-2, Quadralaterals of uncertainty for points on the line q 0 . 




Q 



A - POSITION OF YAW AXIS OF CAMERA 
Al = x, SEPARATION OF IMAGE PLANE AND 
YAW AXIS ALONG THE OPTICAL AXIS 
AO = b, ONE HALF OF SEPARATION OF YAW AXES 
I = POINT ON IMAGE PLANE PIERCED BY 
OPTIC AXIS 

IN = f, FOCAL LENGTH OF LENS SYSTEM 
IP;= OPTIC AXIS 


JS= s, A DISPLACEMENT ON THE IMAGE 
PLANE EQUIVALENT TO ANGLE 8 

PO* q 
PM «= +A q 
PR =-Aq 

a - CONVERGENCE ANGLE 
8- UNCERTAINTY OF ANGLE a 


► 


A 


Fig. B-3. Geometry of variable convergence optics for range finding. 


•*' *> 
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B.3 Systems with Parallel Optic Axes 


When the optic axes are held parallel to the Q axis, as in camera assembly 
Cl, a = 0, and an object at a point P on the Q axis will be represented by corre- 
sponding points S in the image planes. Uncertainties in the position angle ± 6 
will produce equivalent displacements ± As in the image plane. 

The range q of a point P, as seen in Fig. B-4, may be determined as follows: 

Since AMOS ~ ANAS, - ' q 
3 ’ FT! 


f(b + s) 


(B-5) 


The range q + Aq of a point M is 


q + Aq 


f[b +(s -As)] 
s - As 


(B-6) 


Subtracting Eq. (B-6) from (B-5) we obtain the range uncertainty Aq due to un- 
certainty in a position S on the image plane (or equivalently an angular uncer- 
tainty 6 in determining (3) : 


s(s - As) 


and from Eq. (B-5) 


s 


fb 

q -f 


so that 

A s(q - f)2 
^ fb — A s(q — f) 


(B-7) 


Thus it i s seen that the range uncertainty increases approximately with the square 
of the range. Also we see that a range uncertainty, in the direction of increasing 
range, corresponds to an uncertainty in s towards the optic axis. 


Substitution into Eq. (B-7) of values of q (from 0 to 3 meters at intervals of 
0.5 meter), b and f from the text and As from Table 3-2, leads to the range un- 
certainty of camera assembly Cl tabulated in Table B. 1 and plotted in Fig. 3-5. 


B.4 Stereo Optics with Fixed Convergent Axes 

In a system with fixed convergence the field of view of the lenses will in 
general be limited. As shown in Fig. B-5, there will be some near point NP on 
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SYMBOL DEFINITIONS 

q = RANGE OF A POINT P FROM IMAGE PLANE 
A = POINT WHERE OPTIC AXIS PIERCES IMAGE PLANE 
s = DISTANCE ON THE IMAGE PLANE FROM POINT A TO IMAGE OF POINT P 
As = UNCERTAINTY IN THE POSITION OF POINT s CORRESPONDING TO THE UNCERTAINTY ANGLE 8 
+A q = PM 
-Aq = PR 

Fig. B-4. Geometry of fixed optics for range finding. 
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Table B-1. Range and range uncertainty for three camera-computer chains (meters). 



Cl-1 

Dl-3 

Elj-3 

ei 2 

-3 

% 

+Aq 

-Aq 

+Aq 

-Aq 

+Aq 

-Aq 

+Aq 

-Aq 

0.5 

.0417 

.0366 

.0032 

.0031 

.0048 

.0048 

.0011 

.0011 

1.0 

.118 

.0948 

.0137 

.0133 

.0243 

.0231 

.0046 

,0045 

1.5 

.384 

.2041 

.0320 

.0306 

.0553 

.0515 

.0104 

.0102 

2.0 

.540 

.350 

.0578 

.0548 

.0997 

.0908 

.0209 

.0205 

2.5 

.898 

.494 

.0922 

.0845 

.1321 

.1195 

.0288 

.0280 

3.0 

1.431 

.734 

.1346 

.1235 

.1813 

.1617 

.0377 

.0367 

5.0 

9.16 

1.96 

.3743 

.3440 

.6741 

.5901 

.1321 

.1275 



Fig. B-5. Field of view of parallel optical systems. Shaded area Is 
outside view of either system, heavy line bounds area 
viewed by both. 
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the Q axis which is the minimum range at which a stereo image can occur. Thus 
there is a minimum distance at which the range of an object can be determined. 
This is at the point 


‘’NP = 


b 

tan y 


where y is the half angle of the visible field of the lens system. When the optic 
axes are not parallel (af 0) but have fixed convergence a there may also be a 
maximum point at which stereo images can be obtained, FP in Fig. B-6. 



Fig. B-6. Fields of view of optical systems whose axes converge at C. Since 
( a+ y) < 90 " there is a point FP which is the maximum point for 
a stereo imoge. Area outside view of either system is shaded. 
Stereo field is bounded by heavy line. 
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Note to Section 3 


APPENDIX C 

OTHER STEREO CAMERA CONFIGURATIONS 

by 

Joseph Convers 



Fig. C-l. Single vidicon camera in vertical attitude receiving information from □ single 
nodding mirrorand giving 1/2 of surface to each stereo axis. 
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Fig. C-2. Two vidicon cameras in vertical attitude viewing mirrors which nod together 

and rotate individually for stereoscopy with azimuth freedom as in Figs. 3-6 and 3-10. 
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Note to Section 3 


APPENDIX D 

PROPOSED OPTICAL TRAIN FOR CAMERA E 1 

by 

Robert Magee 


The proposed optical train (Fig. D-l) is a dual system combining a wide- 
field camera with a high resolution but smaller-field camera. Both systems 
share the same objective lens by means of a beam splitting element. The physi- 
cal size of the composite focal plane fits within a vidicon tube format size of 
approximately 0.5 inches by 0.5 inches. 


The wide angle objective lens consists of a scaled version of the Minolta 
MC-Rokkor PF lens. This lens is designed for a speed of f/ 1.4 and is a form of 
the Zeiss Biotar, The focal length has been reduced from 58 mm in the Minolta 
design to 40 mm for Camera El. One particular advantage of the lens is its rela- 
tively long back focal length. This lens-to-focal-plane distance is necessary in 
the single lens reflex camera with which the Rokkor PF is used. Since the op- 
tical arrangement of Camera El poses design problems similar to those en- 
countered with a single lens reflex, a logical choice for investigation was the 
form of lens used in the reflex camera. 


WIDE ANGLE FIELD IS 20° BY 10° 
HIGH RESOLUTION FIELD IS 2° BY 1° 


DRAWING: TO SCALE 



Fig. D-l. Proposed optical train for camera El. 
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Light rays forming the wide angle image at the vidicon reflect from a beam 
splitter located to the right of the objective lens. Most of the light, however, is - 

transmitted through the beam splitter and imaged at a field lens. A small central » 

portion of this image is then relayed to the vidicon by means of a ten power micro- 
scope lens. This produces a ten to one scale relation between the two field 
images at the vidicon. The numerical aperture of the microscope lens is not 
adequate to handle all of the rays emanating from the f/1. 4 objective lens and 
it appears that the latter should be stopped down to f/2. This lens would probably 
be used at f/2 for the high resolution field in any case because of the slight im- 
provement in image quality. 

A number of right angle prisms and mirrors are then used to conduct the 
light from relay lens to vidicon. The composite field of view at the vidicon then 
appears similar to that shown in Fig. 3 - 8 . It is important to keep vidicon sig- 
nal levels from both fields of view roughly equivalent. To accomplish this, the 
beam splitter is left uncoated. A very thin, or pellicle, form cf beam splitter 
appears advisable to minimize reflection interference from the second beam- 
splitter surface. 
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Note to Section 5 


APPENDIX E 

CALCULATION OF THE DIRECTION COMPONENTS 
OF THE NORMAL TO TRIANGLE FGJ AND THE 
ANGLE BETWEEN THIS NORMAL AND SUNLIGHT. 

by 

Louis Sutro 

E. 1 Location of Points and Triangles 

Figure E-l is a plan view of the robot aimed at azimuth 43" and viewing 
the mound pictured in Figs. 2-3 and 5-1. Two reference lines have been drawn 
in the north and east directions from which measurements were made to deter- 
mine the relative latitude and longitude of points F, G and J in Table 5-1. 

Figure E-2 is a side view employed by the artist to generate the perspective 
projections in Fig. 5-1. It is presented in lieu of a stereoscopic view which 
could communicate the spatial relations cf all of these figures. 

E.2 Calculation of the Direction Components of the Normal to Triangle FGJ 

The equation of- the plane through any three points (x^, y^, Zj), (Xg, yg, 
and (Xg, yg, Zg) is, in determinant form: 

x y z 1 

x i /i z i 1 

x 2 y 2 z 2 1 

x 3 ^3 z 3 1 

Substitution of the data of Table 5-1 yields the equation of the plane through 
points F, J, and K: 


X 

y 

z 

1 



0.14 

3.45 

-0.09 

1 



-0.46 

3.74 

-0.25 

1 

= C 

(E-2) 

-0.14 

4.64 

0.28 

1 




This equation is of the form 

Ax + By + Cz +D *0 (E-3) 


= 0 


(E-l) 
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Fig. E-1. Plan view of robot and the mound shown near the center of Fig. 2-3. 


& 
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where 



3.45 

- 0.09 



A = 

3.74 

-0.25 


=-0.2977 


4.64 

0.28 




0.14 

-0.09 



B = - 

0.46 

-0.25 


• 0.2668 


0.14 

0.28 




0.14 

3.45 

1 


C = 

-0.46 

3.74 

1 

=-0.6328 


-0.14 

4.64 

1 



(E-4) 


(E-5) 


(E-6) 


The components of a vector perpendicular to the above plane are proportional to 
A, B, and C. 


E. 3 Calculation of the Angle Between This Normal and Sunlight 


The angle between the direction of sunlight and the normal to the plane is 
computed from the equation 


cos 9 = ± 


l] 1 2 +mirri2 


./T 2 2 2 - /T2 2 2" 

V*] + mj+ni \^2 +m 2 +n 2 


(E-7) 


where l m^, are the direction components of the normal to the surface and 
l mg, ng are the direction components of sunlight. Cos 9=0. 71. 
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